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Abstract 

Any symmetric affinity function w : V x V — ► M+ defined on a 
discrete set V induces Euclidean space structure on V. In particular, 
an undirected graph specified by an affinity (or adjacency ) matrix can 
be considered as a metric topological space. We have calculated the 
visual representations of the probabilistic locus for a chain, a polyhe- 
dron, and a finite 2-dimensional lattice. 



1 Introduction 

Flatness is the essential property of Euclidean space. In particular, there 
is no distinguished point that serves as an origin, because it can be translated 
anywhere. This affine-geometric property of Euclidean space is generalized 
by the affine group of all invertible affine transformations from the space into 
itself consisting of a linear transformation described by a matrix A followed 
by a translation, 

y = Ax. + a. 
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It is known that the affine geometry keeps the concepts of straight lines and 
parallel lines but not those of distance between points or value of angles, pQ . 

From a probabilistic point of view, the fact that there is no canonical 
choice of the origin can be interpreted as if it can be at any point in the 
space with a uniform probability. The interesting question arises when the 
probability tc x > to find the origin at the particular point x G V of some 
discrete set V of N points is not uniform, 
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- What does such a space look like? 

One way to describe the disparity between points in the set V is by 
introducing an affinity function w : V x V — > 1R+ U {0} encoding their 
pairwise relations or neighborhoods - two points x G V and y G V are 
neighbors, x ~ y, iff w xy > 0, but w xy = otherwise. Then, the probability 
distribution 7r is given by 

*' = T {) 

The affinity function w xy > defines an undirected weighted graph G(V,E), 
in which V is the set of nodes and E C V x V is the set of edges describing 
the relation between pairs of nodes. 

The aim of this paper is to investigate the properties of the probabilistic 
locus satisfying ([T]) (a graph) specified by the arbitrary symmetric, non- 
degenerated matrix w xy > and to show that it can be regarded as the 
projective space PIR^ -1 ) of homogeneous coordinates obtained by projection 
from the origin n. Given a random walk defined on the graph G(V, E), the 
projective geometry can be reduced to the metric geometry, which is realized 
by the introduction of a Euclidean scalar product, a distance, and a norm 
which can be directly related to the first-access times of nodes by random 
walkers. 

The paper is organized as follows. In (Sec. [2]), we introduce the proba- 
bilistic analog of affinity transformations that can be interpreted as random 
walks defined on the set V with the structure determined by the matrix 
w xy > 0. In (Sec. [3]), we consider the probabilistic projective geometry for 
the space projected from the distinguished probability distribution tc induced 
by a simple stochastic process - a random walk - instead a distinguished point 
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as in the usual projective geometry. In (Sec. HJ), we perform a reduction from 
the probabilistic projective geometry to the probabilistic metric geometry by 
introducing the Euclidean space structure associated to the diffusion process 
defined on V with respect to w xy > 0. Then, in (Sec. Ej), we demonstrate that 
the Euclidean characteristics can be interpreted in terms of the first-access 
properties of nodes by random walkers. Using the metric structure intro- 
duced in Sec. IU we introduce a probabilistic topological space in Sec. El We 
conclude in Sec. [7] and give three visual representations of the probabilistic 
locus of a chain, a polyhedron, and a finite 2-dimensional lattice. 



2 Probabilistic analog of affinity transforma- 
tions 

In a geometric setting, affine transformations [2] are precisely the func- 
tions that map straight lines to straight lines, i.e. preserves all linear com- 
bination in which the sum of the coefficients is 1. Their probabilistic analog 
in the class of stochastic matrices, ^2 y& yT xy = 1, is given by the transition 
probability operators, 

T xy = (1 - &) 6 xy + J? xWxy (3) 

l^ijev Wx v 

where (3 X G [0, 1] and w xy > is the symmetric affinity matrix describing the 
set of paths available from x G V. 

If points of the discrete set V are interpreted as nodes of an undirected 
graph, then the transition operator ([3]) describes a "lazy" random walk on 
G(V, E) specified by the "laziness" parameters /3 X , such that a random walker 
stays in x G V with probability 1 — j3 x , but moves to another node randomly 
chosen among the nearest neighbors with probability (3 X / Y^ y w yx . In partic- 
ular, if (3 X = 1 uniformly for all x G V, the operator describes the usual 
random walks extensively studied in the classical surveys [3],[1]. 

Random walks provide us with an effective tool for the detailed structural 
analysis of connected undirected graphs exposing their symmetries. In [5], 
we have shown that the transition operator ([3]) gives the representation of 
the group of linear automorphisms of the undirected graph G(V,E), in the 
class of stochastic matrices. 

Discrete translational symmetry is obviously broken if the graph G is 
either finite, or irregular. If the graph is finite but regular, then the transition 
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probability operator Q defines a symmetric Markov chain, in which the 
probability of moving to y G V given that a walker is at the node x G V is the 
same as the probability of moving backward. Finally, for a non-regular graph 
G this property is replaced by time-reversibility that can be conveniently 
formulated in terms of the stationary random walks, 

T^x^xy T^yTyxi (4) 

for every pair x, y G V. It is well known (see [3]) that the stationary distribu- 
tion of random walks it defined on the undirected graph G is given by ([2]) and 
is the left eigenvector of the transition operator belonging to its primary 
Perron eigenvalue \i = 1. Since for any choice of /3 X G [0, 1] the matrix T xy > 
is a real positive stochastic matrix, it follows from the Perron- Frobenius the- 
orem (see [8]) that its maximal eigenvalue /x = 1 is simple if the graph G is 
connected. Any other distribution a defined on V such that a x > for any 
x G V and XLev ° x = ^> as y m Ptotically tends to the stationary distribution 
7r under the iterative actions of the transition operator (j3J), 

n = lim oT n . (5) 

n^oo 

The self-adjoint operator associated to T is given by 

f =- (tt^TtT 1 / 2 + tt-^TV/ 2 ) , (6) 

where T T is the adjoint operator, and it is defined as the diagonal matrix 
diag(7Ti, . . . , 7Tjv). While interesting in the spectral calculations of random 
walks characteristics, the symmetric matrix ([6]) is more convenient since its 
eigenvalues are real and bounded in the interval /i G [— 1, 1] and the eigen- 
vectors define an orthonormal basis. 



3 Probabilistic projective geometry 

An affine coordinate system on V is prescribed by an affinely indepen- 
dent set of points for which the displacement vectors e y = y — x, y G V, 
x y, form a basis of V with respect to the point x G V. A displace- 
ment vector v = J2 y ev v v e y ^ s identified with the coordinate (N — l)-tuple 
{y i, . . . , { } x , . . . , Vjy), in which the x— th component is missing. We can as- 
sociate all points V with their relative displacement vectors. 
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This is the distinguished probability distribution ir induced by a simple 
stochastic process - a random walk - is important for the defining an ele- 
mentary coordinate system in the probabilistic projective space. Given a 
symmetric matrix w xy > and a vector j3 x G [0, 1], we can define the transi- 
tion probability by the kernel (j3J) on V and its self-adjoint counterpart (jSJ). 
The complete set of real eigenvectors \& = {/0i> "02? ■ • •V'iv} of the symmetric 
matrix (JBJ), 



ordered in accordance to their eigenvalues, //i = 1 > //2 > ■ ■ ■ Hn > — 1) 
forms an orthonormal basis in R , 



associated to linear automorphisms of the affinity matrix iy ZJ/ , [5]. In ([7]), we 
have used Dirac's bra-ket notations especially convenient for working with 
inner products and rank-one operators in Hilbert space. 

Given the random walk defined by the operator ([3]), then the squared 
components of the eigenvectors \I/ have very clear probabilistic interpreta- 
tions. The first eigenvector ipi belonging to the largest eigenvalue fix = 1 
satisfies ipf x = tt x and describes the probability to find a random walker in 

x G V. The norm in the orthogonal complement of ipi, Yl^^fx = 1 — n x, 
is nothing else but the probability that a random walker is not in x. 

Looking back it is easy to see that the transition operator defines 
a projective transformation on the set V such that all vectors in R^fV) 
collinear to the stationary distribution ir > ([2]) are projected onto a common 
image point. 

Geometric objects, such as points, lines, or planes, can be given a rep- 
resentation as elements in projective spaces based on homogeneous coordi- 
nates, jp. Any vector of the Euclidean space M. N can be expanded into 
v = J2k=i ( v l0fc) (V^Ij as we U as i n to the basis vectors 



\ipi)T = fa 



(ij>x\i>v) = s - 



(7) 




s = 



(8) 



which span the projective space PR' 



N 



V7T- 1 / 2 = £ <v|Vfc> (ft 



k=2 
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since we have always tpi tX = y/W^ > for any x G V. The set of all iso- 
lated vertices p of the graph G(V, E) for which tt p = play the role of the 
plane at infinity, away from which we can use the basis as an ordinary 
Cartesian system. The transition to the homogeneous coordinates (jSJ) trans- 
forms vectors of ~R N into vectors on the (N — l)-dimensional hyper-surface 
{^i,a; = \/ 71 x }, the orthogonal complement to the vector of stationary dis- 
tribution 71. 



4 Reduction to metric geometry 

The key observation is that in homogeneous coordinates the operator 
T k (jv _ i) defined on the (N — 1) -dimensional hyper-surface {ipi iX = tt x } 
determines a contractive discrete-time affine dynamical system. The origin 
is the only fixed point of the map T k ,„ 

limn = (1,0,... 0), (9) 

n— >oo 

for any £ G PM^ -1 ^ and the solutions consist in the linear system of points 
T n £ that hop in the phase space (see Fig. [[]) along the curves formed by 
collections of points that map into themselves under the consecutive action 
of f. 

The problem of random walks ( |3ll6l) defined on finite undirected graphs 
can be related to a diffusion process which describes the dynamics of a large 
number of random walkers. The symmetric diffusion process correspondent 
to the self-adjoint transition operator T describes the time evolution of the 
normalized expected number of random walkers, n(t) 7r -1 / 2 gVxN, 

n = Ln, L = 1 - f (10) 

where L is the normalized Laplace operator. Eigenvalues of L are simply 
related to that of T, = 1 — fi^, k — 1, . . . , N, and the eigenvectors of both 
operators are identical. The analysis of spectral properties of the operator 
ffTU]) is widely used in the spectral graph theory, [B]. 

It is important to note that the normalized Laplace operator (jTOl) defined 
on PRi N - 1} is invertible, 



(11) 
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Figure 1: Any vector £ £ PM.^ ^ asymptotically approaches the origin 7r under 
the consecutive actions of the operator T. 

(jv-i) * s a contraction mapping for any k > 1. The unique inverse 



since T fc 
operator, 



1 = E j t ! i^ l . ( 12 ) 

-L Lie 



is the Green function (or the Fredholm kernel) describing long-range inter- 
actions between eigenmodes of the diffusion process induced by the graph 
structure. The convolution with the Green's function gives solutions to in- 
homogeneous Laplace equations. 

In order to apply metric geometry to the graph G(V,E), one needs to 
introduce the distances between points (nodes of the graph) and the angles 
between lines or vectors that can be done by determining the inner product 
between any two vectors £ and ( in PR^^ as 

(£,CW) = (e,^c)- (is) 

The dot product (fT3l) is a symmetric real valued scalar function that allows 
us to define the (squared) norm of a vector £ G PM.^ ^ with respect to 
(w, (3) by 

U\\\ W ,P) = U,L-^). (14) 
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The (non-obtuse) angle 9 G [0, 180°] between two vectors is then given by 

a ( (^0(w,/3) \ 

= arccos ^— j . (15) 

\ II ? \\(w,p) II C \\( w ,p) J 

The Euclidean distance between two vectors in PIR^ -1 -* with respect to 
(w,/3) is defined by 

= p e (^_o+Pf(e-o 1 J 

where P c (£ - C) = H II and P < K ~ = II C II^HC C)( WJJ) 

are the lengths of the projections of (£ — £) onto the unit vectors in the 
directions of £ and ( respectively. It is clear that P^ (£ — £) = (£ — C) = 

if e = c- 

It is worth mentioning that the spectral representations of the Euclidean 
structure (I13H16P defined for the graph nodes can be easily derived by taking 
into account that (x \ip' s ) = ip' sx . 



5 Euclidean structure and the first-access 
characteristic times of graph nodes 

The structure of Euclidean space introduced in the previous section can be 
related to a length structure V x V — > P + defined on a class of all admissible 
paths V between pairs of nodes in G. It is clear that every path P(x, y) 6 V 
is characterized by some probability to be followed by a random walker which 
in particular depends upon the weights w pq > of all edges p ~ q that join 
it. Therefore, the path length statistics is a natural candidate for the length 
structure on G. 

The theory of random walks on graphs offers the concepts to quantify the 
mutual accessibility of nodes in a graph. The access time or hitting time h xy 
is the expected number of steps before node y is visited, starting from node 
x, [3]. Since the first step takes a walker to a neighbor v ~ x and then the 
walker has to reach y from there, the hitting time should be additive, 

h xy = l+ kvy (17) 

Z^zev Wxz 
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if x 7^ y and h xx = 0. In general h xy is not a symmetric matrix except if the 
graph has a vertex-transitive automorphism group, [3]. 
The sum 

d(x, y) = h xy + h yx (18) 

is called the commute time: this is the expected number of steps in a random 
walk starting at x before node y is visited and then node x is reached again, 

The expected number of steps to first arrival of a random walker at x G V 
starting from another node randomly chosen with respect to the probability 

fx ^ ^ ^yhy X , (19) 

ydV 

is called the first-passage time to x. 

All three quantities are well-known in the theory of random walks on 
graphs and can be calculated in the standard way, [3j. The most important 
observation for us is that their spectral representations, 

hxy = Ef=2 M«-^X») 2 /(1-A*.), 

d(x,y) = Ef= 2 «,-<,) 2 /(l-^), (20) 
/. = Ef= 2 <, / (1 - M.) , 

coincide with those of the Euclidean quantities, P x (x — y), ||x — and 
H^IIm respectively 

6 Probabilistic topological space 

In the previous sections, we have shown that given a symmetric affinity 
function w : V x V — > we can always define an Euclidean metric on V 
based on the first-access properties of the standard stochastic process, the 
random walks defined on the set V with respect to the matrix w xy > 0. 

In particular, we can introduce this metric on any undirected graph 
G(V, E) converting it in a metric space. The Euclidean distance interpreted 
as the commute time induces the metric topology on G(V,E). Namely, we 
define the open metric ball of radius r about any point x G V as the set 

B r (x) = {yeV :d(x,y)<r}. (21) 
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These open balls generate a topology on V, making it a topological space. A 
set U in the metric space is open if and only if for every point x 6 U there 
exists and e > such that B £ {r) C U, |7j. Explicitly, a subset of V is called 
open if it is a union of (finitely or infinitely many) open balls. 

7 Conclusion and examples 

Systems consisting of many individual units that are tied by one or more 
specific types of interdependency are found everywhere in the world. Net- 
works are often very complex and difficult to analyze. Being of rather large 
scale to be seen from a single viewpoint, they can often be abstracted as 
graphs, the natural mathematical tool for facilitating the analysis. 

Euclidean space has a decisive role in visual and propriomotor percepts, 
and in hearing thus determining our spatial perception. In addition, many 
of our feelings, of anger, fear and so on, have important links with parts of 
the body and hence indirectly with Euclidean space. In general, networks do 
not possess the structure of Euclidean space. Thus, a mental representation 
of any network emerges as a result of a long learning process jointly with the 
planning of movements in that and is always challenging to make the proper 
decisions on how to sustain the system and to cope with the new demand. 

In our paper, we have demonstrated that networks do possess the struc- 
ture of Euclidean space, but in the probabilistic sense. 

In order to illustrate the approach, we have shown the probabilistic images 
of a chain (ID-lattice of N = 100 nodes) (see Fig. 2(1,2)), a polyhedron (a 
cycle of iV = 50 nodes) (see Fig. 2(3,4)), and a 2D-lattice £2 containing 10 2 
nodes (see Fig. 3). We suppose that all weights are equal w xy = 1, so that the 
respective affinity matrices are just the adjacency matrices of the graphs. 

Random walks defined on the above networks embed them into the (N — 
l)-dimensional locus of Euclidean space, in which all nodes acquire certain 
norms quantified by the first-passage times to them from randomly chosen 
nodes. Indeed, the structure of (N — l)-dimensional vector spaces induced 
by random walks cannot be represented visually. 

In order to obtain a 3D visual representation of these graphs, we have cal- 
culated their three major eigenvectors {ip2, ^3, tpi}, belonging to the largest 
eigenvalues < 1 of the symmetric transition operators (jUJ). The (ei, e 2 , e 3 )- 
coordinates of the vertex x G V of the graph in 3D space have been taken 
equal to the relevant x th -components of three eigenvectors {i[)<2, i>3, V^}- The 
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Figure 2: Probabilistic images of a chain (N = 100) and a polyhedron 
(N = 50). 

radiuses of balls representing nodes in Figs. 2,3 have been taken proportional 
to the degrees of nodes. In Figs. 2.(1,3) and in Fig. 3(1), we have presented 
the the 3D images of the the chain, polyhedron , and the lattice £2- The 
connections between nodes represent the actual connections between them 
in the real space. 

If we choose one node of a graph as a point of reference, we can draw the 
2-dimensional projection of the (iV — l)-dimensional locus by arranging other 
nodes at the distances calculated accordingly to ffTS]) and under the angles 
( [15]) they are with respect to the chosen reference node. The examples are 
given in Fig. 2 (2,4). 
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Figure 3: 1. Probabilistic image of a 2D-lattice (N = 10 2 ). 2. The matrix 
plot of the probabilistic Euclidean distances (the commute times) between 
the nodes of the lattice £ 2 . 

In particular, in Fig.2(2), we have presented the 2-dimensional projection 
of the 99-dimensional Euclidean locus for a chain with respect to the marginal 
left node. The probabilistic Euclidean distance measured by the commute 
time ([TBI from the left end node is increasing node by node approximately 
as oc y/N from 18.166 random steps for the nearest neighbor node to 180.748 
random steps for the node at the opposite end of the chain (x = 100). In 
Fig. 2 (4), the similar diagram is represented for the polyhedron. It is worth to 
mention that the symmetry of the polyhedron can also be seen in the image 
of its 49— dimensional probabilistic locus. Chosen a vertex of the polyhedron 
as the reference node, the commute time with its nearest neighbors equals 
9.899 random steps, while it takes in average 35.355 random steps in order 
to commute with the vertex on the circumcircle diametrically opposite to the 
origin. 

In Fig. 3(2), we have shown the matrix plot of the probabilistic Euclidean 
distances (the commute times) between all nodes of the lattice £ 2 - The 
distances on the diagonal d(x, x) = 0, and vary harmonically from 15 to 30 
random steps for different pairs of nodes. 
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